The TELLTALE Dynamic Hypertext Environment : Approaches to

نویسندگان

  • Claudia Pearce
  • Ethan Miller
چکیده

Methods and tools for nding documents relevant to a user's needs in document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static corpora, their algorithms are dependent on the language for which they are written, e.g. English, and they don't perform well when presented with misspelled words or text that has been degraded by OCR (optical character recognition) techniques. In this paper, we present the TELLTALE system. TELLTALE is a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR or transmission errors, and that may contain languages other than English by using several techniques based on n-grams (n character sequences of text). In this paper, we identify methods and techniques that we have applied to the n-gram data structures and algorithms to enhance the scalabilty of the TELLTALE Dynamic Hypertext System.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The TELLTALE Dynamic Hypertext Environment: Approaches to Scalability

Methods and tools for nding documents relevant to a user's needs in document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static corpora, their algorithms are dependent on the language for which they are written, e.g. English, and they don't perform well when presented with missp...

متن کامل

TELLTALE: Experiments in a Dynamic Hypertext Environment for Degraded and Multilingual Data

Methods and tools for finding documents relevant to a user’s needs in document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static corpora, their algorithms are dependent on the language for which they are written, e.g. English, and they do not perform well when presented with mi...

متن کامل

Performance and Scalability of a Large-Scale N-gram Based Information Retrieval System

Information retrieval has become more and more important due to the rapid growth of all kinds of information. However, there are few suitable systems available. This paper presents a few approaches that enable large-scale information retrieval for the TELLTALE system. TELLTALE is a dynamic hypertext information retrieval environment. It provides full-text search for text corpora that may be gar...

متن کامل

Valuation Links: Formally Extending the Computational Power of Hypertext

We view hypertext as an inherently dynamic concept to incorporate in the interface of dynamic information systems. What challenges does hypertext face in a constantly changing environment? In this paper, we discuss the benefits and the problems we face in our research into hypertext-oriented decision support systems. Then we focus on a new hypertext construct beneficial to this domain: valuatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997